Explainability Using Bayesian Networks for Bias Detection: FAIRness with FDO

نویسندگان

چکیده

In this paper we aim to provide an implementation of the FAIR Data Points (FDP) spec, that will apply our bias detection algorithm and automatically calculate a FAIRness score (FNS). metrics would be themselves represented as FDOs, could presented via visual dashboard, machine accessible (Mons 2020, Wilkinson et al. 2016). This enable dataset owners monitor level their data. is step forward in making data FAIR, i.e., Findable, Accessible, Interoperable, Reusable; or simply, Fully AI Ready First may discuss context topic with respect Deep Learning (DL) problems. Why are Bayesian Networks (BN, explained below) beneficial for such issues? Explainability – Obtaining directed acyclic graph (DAG) from BN training provides coherent information about independence variables base. generic DL problem, features functions these variables. Thus, one can derive which dominant system. When customers business units interested cause neural net outcome, DAG structure both source importance clarify model. Dimension Reduction — joint distribution associations. The latter play role reducing induce engine: If know random X,Y conditional entropy X Y low, omit since its nearly entire information. We have, therefore, tool statistically exclude redundant Tagging Behavior section less evident those who work domains vision voice. some frameworks, labeling obscure task (to illustrate, consider sentiment problem many categories overlap). tag data, rely on within datasets generate probability. Training BN, when initialize empty DAG, outcomes target parent other nodes. Observing several tested examples, reflect “taggers’ manners”. therefore use DAGs not merely purpose model development learning but mainly taggers policy improve it if needed. conjunction Casual inference Causal Inference highly developed domain analytics. It offers tools resolve questions hand, models commonly do and, real-world raises. There need find framework conjunction. Indeed, frameworks already exist (e.g., GNN). But mechanism merges typical problems causality common. believe flow, described paper, good direction achieving benefits Fairness Bias networks, essence, they reveal columns (or items) modify noise bias, address faults column However, assume have set measure (Purian 2022). networks prominence (as “cause” “effect” data), thus allow us assess overall database. What Networks? motivation using (BN) learn dependencies graphs (DAG), mimic Perrier (2008)). follows probabilistic factorization distribution: node V depends only parents (a r.v independent nodes free node). Real-World Example present way engine tabular python package bnlearn. Since project commercial, variable names were masked; thus, meaningless names. Constructing Our begin by finding optimal DAG. import bnlearn bn = bn.structure_learning.fit(dataframe) now has adjacency matrix found follow: print(DAG[ 'adjmat' ]) outcome form Fig. 1a. Where rows sources (namely arc left elements row) targets (i.e., header receives arcs). drawing obtained get following image: 1b. see rectangle still points arrows itself two discussion Rauber 2021). more variables, I increased number Adding provided new row “False”). following: 1c. So, how construct Now train parameters. Code-wise perform follows: model_mle bn.parameter_learning.fit(DAG, dataframe, methodtype= 'maximumlikelihood' ) change ‘ maximulikelihood ’ bayes beyond. factorized distributions DAG’s structure. given variable: 1d. code create presentation 2. Discussion theoretical concepts usage constructing approximated addition, example end learning: parameters maximum likelihood estimation (MLE) methods, performing inference. metrics, also visualised monitored, taking care FAIRness.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building Detection Using Bayesian Networks

This paper further explores the uses of Bayesian Networks for detecting buildings from digital orthophotos. This work differs from current research in building detection in so far as it utilizes the ability of Bayesian Networks to provide probabilistic methods for evidence combination and, via training, to determine how such evidence should be weighted to maximize classification. In this vein, ...

متن کامل

Using tilt for automatic emphasis detection with Bayesian networks

This paper proposes a new framework for emphasis detection from natural speech, where emphasis refers to a word or part of a word perceived as standing out from its surrounding words. Labeling emphatic words from speech recordings plays a significant role not only in human-computer interactions, but also in building speech corpus for expressive speech synthesis. Many previous researches use the...

متن کامل

Methods for representing bias in Bayesian networks

Bias is intrinsic to observation and reasoning in both humans and automated systems. Bayesian Belief Networks (BBNs) are well suited for representing these biases and for applying bias models to improve reasoning practices, but there are a number of different ways that bias can be represented and integrated into reasoning processes using BBNs. In this paper, we describe a number of methods to m...

متن کامل

Utilizes the Community Detection for Increase Trust using Multiplex Networks

Today, e-commerce has occupied a large volume of economic exchanges. It is known as one of the most effective business practices. Predicted trust which means trusting an anonymous user is important in online communities. In this paper, the trust was predicted by combining two methods of multiplex network and community detection. In modeling the network in terms of a multiplex network, the relat...

متن کامل

Intrusion Detection using Continuous Time Bayesian Networks

Intrusion detection systems (IDSs) fall into two high-level categories: network-based systems (NIDS) that monitor network behaviors, and host-based systems (HIDS) that monitor system calls. In this work, we present a general technique for both systems. We use anomaly detection, which identifies patterns not conforming to a historic norm. In both types of systems, the rates of change vary dramat...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Research Ideas and Outcomes

سال: 2022

ISSN: ['2367-7163']

DOI: https://doi.org/10.3897/rio.8.e95953